MotifCombinator: A Web Tool to Search for Combinations of Cis-Regulatory Motifs

نویسندگان

  • Mamoru Kato
  • Tatsuhiko Tsunoda
چکیده

Gene expression is controlled by combinatorial regulation of transcription factors and cis-regulatory elements in eukaryotes. Experimental procedures can determine several binding sites for selected transcription factors, but they are too laborious to be applied to large-scale studies. Computational methods are thus required to detect the combinatorial regulation at the genomic level. To reveal the combinatorial regulation, recent studies have developed computational methods to detect significant combinations of patterns (motifs) of cis-regulatory elements, using datasets of upstream sequences and expression levels from DNA microarrays, or binding information from ChIP-on-chip arrays [4]. One widespread type of such computational methods is to find combinations of motifs that specifically appear in upstream sequences of co-regulated genes, which are determined to be expressed by a certain threshold of expression levels. Another type of computational method is based on regression analysis between expression levels and motif scores (occurrence frequencies or weight matrix scores) in input sequences [2,3,6]. This type measures the goodness-of-fit of the regression for candidate motifs, and selects motifs with the best fitting scores. Well-known regression methods for this objective are the (multivariate) linear regression method [2,6] and the multivariate adaptive regression spline (MARS) method [3]. This type of computational method can take full advantage of information about expression levels because the methods do not compulsively dichotomize expression levels into whether genes are expressed or not. However, there is no web-based tool to systematically search motif combinations based on these regression methods. Moreover, these methods were developed mainly for simple eukaryotes like Saccharomyces cerevisiae; therefore, there are some limitations when they are applied to higher organisms with complex regulatory systems such as mammals. For example, they cannot practically handle combinations composed of more than two or three motifs. We implemented MotifCombinator, a web-based tool that can systematically search combinations of regulatory motifs based on regression methods. This tool is equipped with the two types of regression methods (the multivariate linear regression and MARS), and moreover it includes the logistic regression. It also employs the genetic algorithm to search combinations composed of more than two or three, or an arbitrary number of motifs. MotifCombinator will serve users to find combinations of regulatory motifs in organisms with complex regulatory systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

COMPASSS (COMplex PAttern of Sequence Search Software), a simple and effective tool for mining complex motifs in whole genomes

MOTIVATION The complete sequencing of the human genome shows that only 1% of the entire genome encodes for proteins. The major part of the genome is made up of non-coding DNA, regulatory elements and junk DNA. Transcriptional regulation plays a central role in a multitude of critical cellular processes and responses, and it is a central force in the development and differentiation of multicellu...

متن کامل

CREME: Cis-Regulatory Module Explorer for the human genome

The binding of transcription factors to specific regulatory sequence elements is a primary mechanism for controlling gene transcription. Eukaryotic genes are often regulated by several transcription factors whose binding sites are tightly clustered and form cis-regulatory modules. In this paper, we present a web server, CREME, for identifying and visualizing cis-regulatory modules in the promot...

متن کامل

Genome surveyor 2.0: cis-regulatory analysis in Drosophila

Genome Surveyor 2.0 is a web-based tool for discovery and analysis of cis-regulatory elements in Drosophila, built on top of the GBrowse genome browser for convenient visualization. Genome Surveyor was developed as a tool for predicting transcription factor (TF) binding targets and cis-regulatory modules (CRMs/enhancers), based on motifs representing experimentally determined DNA binding specif...

متن کامل

Plant cis-acting regulatory DNA elements (PLACE) database: 1999

PLACE (http://www.dna.affrc.go.jp/htdocs/PLACE/) is a database of nucleotide sequence motifs found in plant cis-acting regulatory DNA elements. Motifs were extracted from previously published reports on genes in vascular plants. In addition to the motifs originally reported, their variations in other genes or in other plant species in later reports are also compiled. Documents for each motif in...

متن کامل

SIREs: searching for iron-responsive elements

The iron regulatory protein/iron-responsive element regulatory system plays a crucial role in the post-transcriptional regulation of gene expression and its disruption results in human disease. IREs are cis-acting regulatory motifs present in mRNAs that encode proteins involved in iron metabolism. They function as binding sites for two related trans-acting factors, namely the IRP-1 and -2. Amon...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006